Awni00's group workspace
Abstractor - L=1, d=128, h=8
What makes this group special?
Tags
expert-hill-9
Notes
Author
State
Finished
Start time
August 2nd, 2024 4:16:26 PM
Runtime
5h 9m 25s
Tracked hours
-
Run path
dual-attention/dual_attention--math--algebra__sequence_next_term/098r9psq
OS
Linux-4.18.0-477.51.1.el8_8.x86_64-x86_64-with-glibc2.28
Python version
3.11.7
Git repository
git clone https://www.github.com/awni00/abstract_transformer
Git state
git checkout -b "expert-hill-9" 9ce16be9a1cd1fba943e52654fc94c965461f2e0
Command
/gpfs/radev/project/lafferty/ma2393/abstract_transformer/experiments/math/train_abstractor_model.py --task algebra__sequence_next_term --n_epochs 100 --batch_size 512 --d_model 128 --dff 256 --symbol_type symbolic_attention --n_layers 1 --n_heads 8
System Hardware
| CPU count | 32 |
| Logical CPU count | 32 |
| GPU count | 1 |
| GPU type | NVIDIA A40 |
W&B CLI Version
0.17.5
Config
Config parameters are your model's inputs. Learn more
- {} 12 keys▶
- {} 12 keys▶
- 128
- {} 7 keys▶
- {} 6 keys▶
- "Abstractor - L=1, d=128, h=8"
- 161
- {} 2 keys▶
- "token"
- 85
- 1
- 1
- 31
- 85
- {} 2 keys▶
- "token"
- 85
Summary
Summary metrics are your model's outputs. Learn more
- {} 10 keys▶
- 99
- 0.06803567707538605
- 1.3182429075241089
- 0.907868504524231
- 0.06803567707538605
- 1.3182429075241089
- 0.907868504524231
- 0.05778918042778969
- 0.8962574601173401
- 390,599
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...